CDS
Accession Number | TCMCG075C11413 |
gbkey | CDS |
Protein Id | XP_007040446.2 |
Location | complement(join(34799070..34799261,34799403..34799888,34800387..34800533,34801446..34802268,34803063..34803220,34803728..34805233)) |
Gene | LOC18606652 |
GeneID | 18606652 |
Organism | Theobroma cacao |
Protein
Length | 1103aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_007040384.2 |
Definition | PREDICTED: protein SPA1-RELATED 2 isoform X2 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
COG_category | J |
Description | SPA1-related |
KEGG_TC | - |
KEGG_Module | - |
KEGG_Reaction | - |
KEGG_rclass | - |
BRITE |
ko00000
[VIEW IN KEGG] ko00001 [VIEW IN KEGG] |
KEGG_ko |
ko:K16240
[VIEW IN KEGG] |
EC | - |
KEGG_Pathway |
ko04712
[VIEW IN KEGG] map04712 [VIEW IN KEGG] |
GOs |
GO:0000151
[VIEW IN EMBL-EBI] GO:0005575 [VIEW IN EMBL-EBI] GO:0005622 [VIEW IN EMBL-EBI] GO:0005623 [VIEW IN EMBL-EBI] GO:0031461 [VIEW IN EMBL-EBI] GO:0032991 [VIEW IN EMBL-EBI] GO:0044424 [VIEW IN EMBL-EBI] GO:0044464 [VIEW IN EMBL-EBI] GO:0080008 [VIEW IN EMBL-EBI] GO:1902494 [VIEW IN EMBL-EBI] GO:1990234 [VIEW IN EMBL-EBI] |
Sequence
CDS: ATGGATGGAGGACTATCTGACGAAGTAGCTCCAATAGATGCGGCTGAGGGGACTCACCTCCAGGGTAAAGAGGTTGAGTATTTGATGAAACCTGACAACTGCAACATGTTGGAGTCCCGAGAAGTGGTGATACCGGATGAGGTTAATACTATTGAGAGCTCATTCCAAGTTCTTGGAAATATGTTGGAAGGTAAGAAAGTAAATAGGAGTATAGGTCCTGTAAATGTATCAGAACATGGATGTAGTAGCCCTCGCACTATTGATGATGCAAATGACATGGTTGAAGAGTTAACTGTGAGAAATTACAATGGTTCTAACTTACCCATGGTTGGTACATCAAATAATAGAGAAAGAATGCAGATGAGACAGAACCATTGGCAGCATTTTTATCAGTTGGTGGGTGGATCAGGAAGTGGGGGTTCATGTGGCAACCGGGACAACAGTCAGGCAATGCCAAGTATGTCACAGGATGTAGGGTATGCATCTTTTCTTGAATTTTTGGGTCAGAAACCTTTGAGTGATGGGCGGAATGAAGCGACAGAACATTTGATGAGTGGTGATATTATTGAGGTTTCGGGCAGTCAGCTATCTCATGGAGGTATTAAGACAAAGATTCTATCAAAATCAGGATTTTCTGAATTTTTTGTCAAAACTACATTGAAAGGGAAAGGAGTTATATGTAGAGGTCCATCTCATGATGCCTCAAGAGTTGAGCCTAGAGATCAGAATAACACAAAATCTACTGAGGGAACTATGGTGGCTCCTACTGCTCCACTGAAGGCTGCCGGAAGCCCTGTGGTGGCTTCGAATACGTCACTGATCTTGGTTAACAAAGCCGTTATGACTTCTTCTTCCTATGGGATCATGGGGCCAAGGGCTGGTGAGTGTGATCGTGATGGAATGAACTTGAGAGAATGGCTGAAAGCACAGTGCCATAAAGCAAAAAAATCGGAGTGCTTGTATATATTCAAACAAATTGTGGATCTGGTTGATTATTCTCACTCTCAAGGAGTTATCTTGCATGATTTGCGCCCATCTTTCTTCAAGTTGCTGCAACCTAAACAGGTTAAATATATTGGTTCAGGTGTCCAAAAAGGACTACTAGATACTGTCTTGGATAAAGATATCCCCCCCTCAGAGAACTTTCTGATTAGGAGAAGACCAATGGAGCAGGGGATGATTTCATCAGTTGGCCTCTGTGCTAAAAAGCAGAGGTTCAATGAAAATAAAAACTCGACACGATGGCCTCTCTTTCATTCTAGAGCTGGACCCAAAATTGAAACTGTAAATAATACCCAATTTTCTCACAATGAATCTAGTGAGCATTGTTTTAATACAGAACTTAGCAATTCTGGCAGCCCTTATGCATCTAATTCAGCTCAGCAGCAGTCAATCTCTGTGAATGAGCAGTTGGAAGAGAAGTGGTATGCAAGTCCGGAGGAACTTAATGAAGGAGTTTGCACAATTTCATCAAATATTTACAGTTTGGGCGTGCTGCTTTTTGAGTTACTTGGTCATTTTGAATCTGAGAGAGCACATGCTGCAGCAATGTTAGATCTACGTCATAGGATTTTTCCTCCAACTTTTCTGTCAGAAAATCTCAAGGAAGCTGGATTTTGTCTTCGGCTACTTCATCCAGAACCTTCTTTACGCCCAACAACCAGGGATATCCTACAATCTGAAGTAATTAATGGATTCCAGGAAGTAATTGCAGAAGAATTATCATCATCCATTATCCAAGATGATACCGAATCAGAGCTATTATTGCATTTCCTTAGTTTATTAAAAGAGCAACAGCAGAAGCATGCCTCAAAATTAATGGAGGATATTTCATGCCTAGAAGCAGACATTGAAGAGGTTGAGAGAAGGCGCTGTTCCAGAAAACCTTTAACTTATTCTTCCTGTAATGTGAGAGAATGTAGGCACCTTGGCAAAGAACCTCCAATTTCAGAGGTGCATTCTGGTTTATACCAGCTTTCCAGTGCCAGTGAAATGAGGTTAATGAGAAATATCAATCACCTCGAAACTGCTTATTTCTCTATGAGATCAAGAGTCCAGTTTCGTGAGACTGATTCAATGACACGGCCAGATAAGGATTTACTTGAAAATCGTGAGAACTGGCATTCGGCTCAAAACAATGAAGAAATACCAAATCCTACTGATAGTCTTGGGGCCTTCTTTGATGGTTTGTGCAAGTATGCTCGATATAGCAAGTTTGAAGTCTGTGGGATACTGAGAAGTGGGGAGTTCAACAACTCTGCAAATGTAATCTGTTCTTTGAGTTTTGACCGTGATGAGGATTATTTTGCCGCTGCTGGGGTCTCTAAGAAAATTAAGATATTTGAGTTTAATGCACTCTTTAATGACTCTGTTGATATTCATTATCCAGTCATTGAGATGTCAAACAAATCAAAGCTTAGCTGTGTTTGCTGGAACAACTATATCAAGAACTATCTGGCTTCAACTGACTATGACGGTCTGGTCAAGTTATGGGATGCAAGCACTGGTCAAGCTGTCTCTCATTTTATTGAGCATGAAAAGAGAGCTTGGTCTGTGGACTTTTCTCGGGTGTATCCAACAAAATTAGCTAGTGGCAGTGATGATTGTTCTGTGAAACTATGGAGCATTAGTGAGAAAAGCTGCTTAGGAACCATCCGGAATATTGCAAATGTCTGCTGCGTTCAGTTTTCTGCCCACTCCACTCATTTACTGGCATTTGGATCTGCGGATTACAAAACATATTGTTATGATCTTCGGAATACCAGAGCACCATGGTGTGTTCTTGGTGGCCATGATAAAGCTGTGAGCTATGTGAAATTCCTGGACTCAGAAACTGTAGTTACTGCTTCCACTGACAACACATTGAAACTTTGGGACCTCAATAAAACCAGTAGTGCTGGCCTGTCCCTTAATGCTTGCAGCTTAACGTTTCGTGGACATACTAATGAGAAGGTTGGCTTTTGCCTCTGGAAAATAGTTTTTTGTTCTTATTATATCTCAACTCTGACAAGACTTTTGTCTTCGTTTGTGTTTGGTTTGACTTTTCACCTGCTTCTCCAGAACTTTGTGGGTTTATCCGCTGCTGATGGTTATATAGCTTGTGGTTCAGAAACAAATGAGGTTTGTGCTTACTATAGATCTCTGCCTATGCCAATCACTTCACATAAATTTGGGTCGATTGATCCTATTTCTGGGAAAGAGACTGATGATGACAACGGGCTGTTTGTATCAAGTGTCTGCTGGAGAGGGAAATCAGACATGGTTGTCGCTGCAAATTCTAGTGGATGTATTAAAGTGTTGCAGATGGTCTAA |
Protein: MDGGLSDEVAPIDAAEGTHLQGKEVEYLMKPDNCNMLESREVVIPDEVNTIESSFQVLGNMLEGKKVNRSIGPVNVSEHGCSSPRTIDDANDMVEELTVRNYNGSNLPMVGTSNNRERMQMRQNHWQHFYQLVGGSGSGGSCGNRDNSQAMPSMSQDVGYASFLEFLGQKPLSDGRNEATEHLMSGDIIEVSGSQLSHGGIKTKILSKSGFSEFFVKTTLKGKGVICRGPSHDASRVEPRDQNNTKSTEGTMVAPTAPLKAAGSPVVASNTSLILVNKAVMTSSSYGIMGPRAGECDRDGMNLREWLKAQCHKAKKSECLYIFKQIVDLVDYSHSQGVILHDLRPSFFKLLQPKQVKYIGSGVQKGLLDTVLDKDIPPSENFLIRRRPMEQGMISSVGLCAKKQRFNENKNSTRWPLFHSRAGPKIETVNNTQFSHNESSEHCFNTELSNSGSPYASNSAQQQSISVNEQLEEKWYASPEELNEGVCTISSNIYSLGVLLFELLGHFESERAHAAAMLDLRHRIFPPTFLSENLKEAGFCLRLLHPEPSLRPTTRDILQSEVINGFQEVIAEELSSSIIQDDTESELLLHFLSLLKEQQQKHASKLMEDISCLEADIEEVERRRCSRKPLTYSSCNVRECRHLGKEPPISEVHSGLYQLSSASEMRLMRNINHLETAYFSMRSRVQFRETDSMTRPDKDLLENRENWHSAQNNEEIPNPTDSLGAFFDGLCKYARYSKFEVCGILRSGEFNNSANVICSLSFDRDEDYFAAAGVSKKIKIFEFNALFNDSVDIHYPVIEMSNKSKLSCVCWNNYIKNYLASTDYDGLVKLWDASTGQAVSHFIEHEKRAWSVDFSRVYPTKLASGSDDCSVKLWSISEKSCLGTIRNIANVCCVQFSAHSTHLLAFGSADYKTYCYDLRNTRAPWCVLGGHDKAVSYVKFLDSETVVTASTDNTLKLWDLNKTSSAGLSLNACSLTFRGHTNEKVGFCLWKIVFCSYYISTLTRLLSSFVFGLTFHLLLQNFVGLSAADGYIACGSETNEVCAYYRSLPMPITSHKFGSIDPISGKETDDDNGLFVSSVCWRGKSDMVVAANSSGCIKVLQMV |